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(54) Information processing apparatus and methods 

(57) In a data processing apparatus, data is trans- 
ferred in accordance with a meta-instruction embedded 
in the data. Initially, a first meta-instruction is read out 
from an address ADDR0 stored in a tag address register 
Dn_TADR. Then, data following the first meta-instruc- 
tion with a length specified by the meta-instruction is 
transferred. Subsequently, a second meta-instruction 
stored at an address ADDR2 specified in the first meta- 
instruction is read out and data following the second me- 
ta-instruction with a length specified by the second me- 
ta-instruction is transferred. A third meta-instruction 
stored at an address ADDR1 specified in the second 
meta-instruction is further read out and data following 
the third meta-instruction with a length specified by the 
third meta-instruction is then transferred. 
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Description 



The present invention relates to information 
processing apparatus and information processing meth- 
ods. More particularly, the present invention relates to 
such information processing in which the order and the 
priority of transfer of data is controlled. 

The performance of home entertainment systems 
and personal computers has been improving due to 
higher frequencies of the processing LSIs (large scale 
integrated circuits) and the increased scale of circuit in- 
tegration thereof. In order to further enhance the 
processing performance of the systems, a plurality of 
processing devices such as a graphics processor and 
an image thawing processor are integrated into a single 
LSI for implementing a system which is capable of car- 
rying out processing in parallel. With such a single LSI, 
however, it is the memory cost that remains to be re- 
duced. Thus, a memory is shared by the processing de- 
vices as a way of reducing the memory cost. A config- 
uration in which a memory is shared by the processing 
devices is referred to as a UMA (Unified Memory Archi- 
tecture). 

In comparison with the increased operating fre- 
quencies of processing LSIs and the increased scales 
of circuit integration thereof, however, the storage ca- 
pacity of a general memory is not increased so much in 
comparison with the conventional memory manufac- 
tured so far at the same cost, giving rise to a problem 
that the storage capacity of the memory remains con- 
stant, a problem in the performance improvement that 
remains to be solved. 

In addition, also in an architecture wherein a mem- 
ory is shared by a plurality of processors such as the 
UMA, there is the problem that the speed of access to 
the memory is slowed down, a problem in the perform- 
ance improvement that remains to be solved. 

Addressing the problems described above, it is an 
aim of at least a preferred embodiment of the present 
invention to provide an information processing appara- 
tus and an information processing method that allow da- 
ta to be processed efficiently without requiring an exces- 
sive storage capacity of the memory by embedding a 
control instruction in the data itself. 

According to a first aspect of the invention there is 
provided an information processing apparatus wherein: 
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is provided an information processing apparatus where- 
in: 

a bus and a memory are shared on a time division 
5 basis by a plurality of processing units for carrying 
out 3-dimensional graphics processing in parallel; 
and said memory is used for storing a list including 
data for said 3-dimensional graphics processing to 
be transferred to said processing units and an in- 
10 struction for controlling transfers of said data to said 
processing units. 

According to a third aspect of the invention there is 
provided an information processing apparatus wherein: 

15 

a bus and a memory are shared on a time division 
basis by a plurality of processing units for carrying 
out 3-dimensional graphics processing in parallel; 
said information processing apparatus provided 

20 with a data transferring means used for reading out 
a list including data for said 3-dimensional graphics 
processing to be transferred to said processing 
units and an instruction for controlling transfers of 
said data to said processing units from said memory 

25 and used for transferring said data to said process- 
ing units in accordance with said instruction. 

According to a fourth aspect of the invention there 
is provided an information processing apparatus where- 
30 in: 

a bus and a memory are shared on a time division 
basis by a plurality of processing units for carrying 
out 3-dimensional graphics processing in parallel; 

35 said information processing apparatus provided 
with a list generating means used for generating a 
list including data for said 3-dimensional graphics 
processing to be transferred to said processing 
units and an instruction for controlling transfers of 

40 said data to said processing units and used for stor- 
ing said list in said memory; and 
provided with a data transferring means used for 
reading out said list from said memory and used for 
transferring said data on said list to said processing 

45 units in accordance with said instruction included 
on said list. 



a bus and a memory are shared on a time division 
basis by a plurality of processing units for carrying 
out 3-dimensional graphics processing in parallel; so 
said information processing apparatus provided 
with a list generating means for generating a list in- 
cluding data for said 3-dimensional graphics 
processing to be transferred to said processing 
units and an instruction for controlling transfers of 
said data to said processing units. 

According to a second aspect of the invention there 



According to a fifth aspect of the invention there is 
provided an information processing method whereby: 

a bus and a memory are shared on a time division 
basis by a plurality of processing units for carrying 
out 3-dimensional graphics processing in parallel; 
said information processing method comprising the 
steps of: 

generating a list including data for said 3-di- 
mensional graphics processing to be trans- 
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ferred to said processing units and an instruc- 
tion for controlling transfers of said data to said 
processing units and storing said list in said 
memory; 

reading out said list from said memory; and 
transferring said data on said list to one of said 
processing units in accordance with said in- 
struction included on said list. 

The invention will now be described by way of ex- 
ample with reference to the accompanying drawings, 
throughout which like parts are referred to by like refer- 
ences, and in which: 

Fig. 1 is a plan diagram showing a typical home en- 
tertainment system to which an information 
processing apparatus embodying the present in- 
vention may be applied; 

Fig. 2 is a diagram showing a front view of the home 

entertainment system shown in Fig, 1 ; 

Fig. 3 is a diagram showing a side view of the home 

entertainment system shown in Fig. 1 ; 

Fig. 4 is a plan diagram showing a typical CD-ROM, 

from which information is played back in the home 

entertainment system shown in Fig. 1; 

Fig. 5 is a block diagram showing a typical internal 

electrical configuration of the home entertainment 

system shown in Fig. 1 ; 

Fig. 6 is a block diagram showing a detailed config- 
uration of a main DMAC, a main CPU, the main 
memory, a second vector processing engine 
(VPE1) and a GPU shown in Fig. 5; 
Figs. 7A and 7B are diagrams showing a procedure 
for processing display lists generated by a plurality 
of processors; 

Fig. 8 is a block diagram showing another typical 

configuration of the home entertainment system in 

which three processors control the GPU; 

Fig. 9 is a diagram showing a typical format of a 

meta-instruction; 

Fig. 1 0 is a diagram used for explaining a procedure 
for transferring data in accordance with meta-in- 
structions; 

Fig. 11 is a diagram used for explaining another pro- 
cedure for transferring data in accordance with me- 
ta-instructions; 

Fig. 1 2 is a diagram used for explaining a still further 
procedure for transferring data in accordance with 
meta-instructions; 

Fig. 1 3 is a diagram used for explaining a procedure 
for transferring data in accordance with a display 
list; and 

Fig. 1 4 is a diagram used for explaining stall control. 

Figs. 1 to 3 are each a diagram showing a typical 
home entertainment system to which an information 
processing apparatus embodying the present invention 
may be applied. As shown, the system comprises an 
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entertainment system main unit 2 in addition to an op- 
eration unit 1 7 and a recording unit 38 which can be con- 
nected to the entertainment system main unit 2. 

As shown in Figs. 1 to 3, the entertainment system 

s main unit 2 has an all but square shape. The entertain- 
ment system main unit 2 comprises a disc mounting 
sub-unit 3 for mounting a CD-ROM (compact disc read 
only memory) 40 located at the center thereof, a reset 
switch 4 located at a proper position on the entertain- 

10 ment system main unit 2 for use by the user to arbitrarily 
resetting a running application, a power supply switch 5 
for use by the user to turn on and off a power supply, a 
disc operation switch 6 for use by the user to mount a 
disc on the disc mounting sub-unit 3 and connectors 7A 

is and 7B on the right and left respectively for use by the 
user to connect the entertainment system main unit 2 to 
the operation unit 17 which is used for carrying out op- 
erations while an application is running and the record- 
ing unit 38 for recording the setting various kinds of in- 

20 formation of the running application and the like. It 
should be noted that the CD-ROM is a kind of optical 
disc like one shown in Fig. 4.. The CD-ROM is a disc 
used as a recording medium of a running application. 
As shown in Figs. 2 and 3, the connectors 7A and 

25 7B are each designed into two levels. At the upper level 
of each of the connectors 7A and 7B, a recording insert 
portion 8 for connecting the entertainment system main 
unit 2 to the recording unit 38 is provided. At the lower 
level of each of the connectors 7A and 7B, on the other 

so hand, a connector pin insert portion 12 for connecting 
the entertainment system main unit 2 to the operation 
unit 17 is provided. 

The recording insert portion 8 comprises a horizon- 
tal long rectangular insert hole and a memory terminal 

35 into which the recording unit 38 is inserted. The memory 
terminal is placed inside the hole and not shown in the 
figure. With the recording unit 38 not connected to the 
entertainment system main unit 2, the recording insert 
portion 8 is covered by a shutter 9 for protecting the 

40 memory terminal against dust and the like as shown in 
Fig. 2. It should be noted that the recording unit 38 has 
an electrically programmable ROM into which the main 
CPU 44 records data of application software. 

When mounting the recording unit 38 on the enter- 

45 tainment system main unit 2, the user pushes the shutter 
9 toward the inside of the recording insert portion 8 by 
using the end of the recording unit 38, further inserting 
the recording unit 38 into the insert hole till the recording 
unit 38 gets connected with the memory terminal. 

50 As shown in Fig. 2, the connector pin insert portion 
12 comprises a horizontal long rectangular insert hole 
and a connector terminal 12A for connecting the con- 
nector insert portion 12 to a connector terminal portion 
26 of the operation unit 17. 

55 As shown in Fig. 1 , the operation unit 1 7 has a struc- 
ture that can be held and sandwiched by both the hands 
of the user and can be operated by free movements of 
the five fingers of each hand. The operation unit 1 7 com- 
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prises operation sub-units 18 and 19 located symmetri- 
cally on the right and left sides, a select switch 22 and 
a start switch 23 located between the operation sub- 
units 18 and 19, operation sub-units 24 and 25 located 
in front of the operation sub-units 1 8 and 1 9 respectively 
and a connector 26 and a cable 27 for connecting the 
operation unit 17 to the entertainment system main unit 
2. 

Fig. 5 is a diagram showing a typical internal elec- 
trical configuration of the entertainment system main 
unit 2. As shown in the figure, the entertainment system 
main unit 2 has a main bus 41 and a sub-bus 42 which 
are connected to each other by a sub-bus interface 
(SBUSIF) 43. 

Connected to the main bus 41 are a main CPU ( 
central processing unit) 44 (a list generating means) im- 
plemented by components such as a microprocessor 
and a VPEO (a first vector processing engine) 71 , a main 
memory 45 implemented by a RAM (random access 
memory), a main DMAC (main direct memory access 
controller) 46 (a data transferring means), an MDEC 
(MPEG (Moving Picture Experts Group) decoder) 47, a 
VPE1 (a second vector processing engine) 48. In addi- 
tion, a GPU (graphical processing unit) 49 is also con- 
nected to the main bus 41 through a GPUIF (graphical 
processing unit interface) 72. A CRTC (CRT controller) 
84 is provided on the GPU 49. In addition, a frame mem- 
ory 58 is connected to the GPU 49. 

On the other hand, connected to the sub-bus 42 are 
a sub-CPU 50 implemented by components such as a 
microprocessor, a sub-memory 51 implemented by a 
RAM, a sub-DMAC 52, a ROM 53 for storing programs 
such as an operating system, an SPU (sound process- 
ing unit) 54, a communication control unit (ATM) 55, a 
CD-ROM drive 56 serving also as the disc mounting 
sub-unit 3 cited earlier and an input unit 57. The con- 
nector terminal 12Aof the input unit 57 is connected to 
the connector terminal portion 26 of the operation unit 
17 as described earlier. 

Connected to both the main bus 41 and the sub-bus 
42, the SBUSIF 43 passes on data coming from the 
main bus 41 to the sub-bus 42 and, in the contrary, for- 
wards data coming from the sub-bus 42 to the main bus 
41. 

When the entertainment system main unit 2 is acti- 
vated, the main CPU 44 fetches out instructions of an 
activation program from the ROM 53 connected to the 
sub-bus 42 by way of the SBUSIF 43, executing the in- 
structions of the activation program in order to activate 
the operating system. 

In addition, the main CPU 44 issues a request to 
read data to the CD-ROM drive 56 in order to acquire 
data and an application program from the CD-ROM 40 
mounted on the CD-ROM drive 56, loading the applica- 
tion program into the main memory 45. 

Furthermore, in conjunction with the first vector 
processing engine (VPEO) 71, the main CPU 44 gener- 
ates data for non-type processing, that is, polygon def- 
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inition information, from data of a 3-dimensional object 
comprising a plurality of basic figures such as polygons 
read out from the CD-ROM 40. An example of the data 
of a 3-dimensional object is coordinate values of verti- 

5 ces or representative points of a polygon. The VPEO (the 
first vector processing engine) 71 has a plurality of 
processing elements for processing floating point real 
numbers and is thus capable of carrying out pieces of 
floating point processing in parallel. 

10 To put it in detail, the main CPU 44 and the VPEO 
(the first vector processing engine) 71 carry out geom- 
etry processing that entails detailed operations in poly- 
gon units. An example of such processing is generation 
of data of a polygon which represents a swinging state 

*s of a leaf of a tree blown by a wind or a state of drops of 
rain hitting the front window of a car. Then, vertex infor- 
mation found from the processing and polygon definition 
information such as shading mode information are sup- 
plied to the main memory 45 as packets by way of the 

20 main bus 41. 

The polygon definition information comprises draw- 
ing area setting information and polygon information. 
The drawing area setting information includes offset co- 
ordinates in the frame memory 58 of a drawing area, 

25 that is, a frame memory address of the drawing area, 
and coordinates of a drawing clipping area for canceling 
an operation to draw a drawing range indicated by a pol- 
ygon with coordinates thereof existing outside the draw- 
ing area. On the other hand, the polygon information in- 

30 dudes polygon attribute information and vertex informa- 
tion. Here, the polygon attribute information is informa- 
tion used for specifying a shading mode, an ALPHA 
blending mode and a texture mapping mode. On the oth- 
er hand, the vertex information is information on coordi- 

35 nates in a vertex drawing area, coordinates in a vertex 
texture area and the color of a vertex, to mention a few. 

Much like the first processing engine (VPEO) 71 , the 
second vector processing engine (VPE1) 48 has a plu- 
rality of processing elements for processing floating 

40 point real numbers and is thus capable of carrying out 
pieces of floating point processing in parallel. The VPE1 
48 is capable of generating an image in accordance with 
operations carried out by using the operation unit 1 7 and 
matrix operations. That is to say, the second vector 

45 processing engine (VPE1) 48 generates data (polygon 
definition information) for type processing, that is, for 
processing that is relatively simple enough to be carried 
out by execution of a program on the VPE1 48. Exam- 
ples of such processing which is carried out by the see- 
so ond vector processing engine (VPE1) 48 are radios 
copy conversion for an object having a simple shape 
such as a building or a car, parallel light source calcu- 
lation and generation of a 2-dimensional curved surface. 
Then, the polygon definition information generated by 

55 the VPE1 48 is supplied to the GPUIF 72. 

Controlled by the main CPU 44, the GPUIF 72 also 
receives polygon definition information from the main 
memory 45 through the main bus 41. For this reason, 
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the GPUI F 72 adjusts processing timing so as to prevent 
the polygon definition information originated by the main 
CPU 44 from colliding with the polygon definition infor- 
mation supplied thereto by the second vector process- 
ing engine 48, passing on them to the GPU 49. 5 

The GPU 49 draws an image expressing a 3-dimen- 
sional object using a polygon based on the polygon def- 
inition information supplied thereto by way of the GPUIF 
72 on the frame memory 58. An image drawn using a 
polygon for expressing a 3-dimensional object is re- 10 
ferred to hereafter as a polygon image. Since the GPU 
49 is capable of using the frame memory 58 also as a 
texture memory, the GPU 49 can carry out texture map- 
ping processing to stick a pixel image in the frame mem- 
ory 58 on a polygon as a texture. 15 

The main DMAC 46 is used for controlling, among 
other operations, DMA transfers to and from a variety of 
circuits connected to the main bus 41. In addition, de- 
pending on the state of the SBUSIF 43, the main DMAC 
46 is also capable of controlling, among other opera- 20 
tions, DMA transfers to and from a variety of circu its con- 
nected to the sub-bus 42. The MDEC 47 operates con- 
currently with the main CPU 44, decompressing data 
which has been compressed in accordance with an 
MPEG (Moving Picture Experts Group) system or a 25 
JPEG (Joint Photographic Experts Group). 

The sub-CPU 50 carries out various kinds of 
processing by execution of programs stored in the ROM 
53. The sub-DMAC 52 controls, among other opera- 
tions, DMA transfers to and from a variety of circuits con- 30 
nected to the sub-bus 42 only when the SBUSIF 43 is 
disconnected from the main bus 41 and the sub-bus 42. 

The SPU 54 reads out sound data from a sound 
memory 59, outputting the sound data as an audio sig- 
nal in accordance with a sound command received from 35 
the sub-CPU 50 or the sub-DMAC 52. The output audio 
signal is then supplied to a speaker 202 by way of an 
amplifier circuit 201 to be finally output by the speaker 
202 as sound. 

The communication control unit (ATM) 55 is con- 40 
nected to a public communication line or the like and 
used for transmitting and receiving data through the line. 

The input unit 57 comprises the connector terminal 
12A for connecting the operation unit 17 to the enter- 
tainment system main unit 2, a video input circuit 82 for 45 
supplying video data coming from other apparatuses not 
shown in the figure to the entertainment system main 
unit 2 and an audio input circuit 83 for supplying audio 
data coming from the other apparatuses to the enter- 
tainment system main unit 2. so 

Fig. 6 is a block diagram showing a detailed config- 
uration of the main DMAC 46, the main CPU 44, the 
main memory 45, the second vector processing engine 
(VPE1) 48 and the GPU 49 shown in Fig. 5. 

As shown in Fig. 6, the main CPU 44 comprises a s $ 
CPU core (CORE) 94, an instruction cache (l$) 95, a 
scratch pad RAM (SPR) 96, a data cache (D$) 97 and 
the first vector processing engine (VPE0) 71 . The CPU 
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core 94 executes predetermined instructions. The in- 
struction cache 95 is used for temporarily storing in- 
structions to be supplied to the CPU core 94. The 
scratch pad RAM 96 is used for storing results of 
processing carried out by the CPU core 94. Finally, the 
data cache 97 is used for temporarily storing data to be 
used in the execution of processing by the CPU core 94. 

The first vector processing engine (VPE0) 71 com- 
prises a micromemory (microMEM)98, an FMAC (Float- 
ing Multiple Adder Calculation) unit 99, a divisor (DIV) 
100, a functional unit 101 referred to as a VU-MEM and 
a packet expander (PKE) 102. The VU-MEM 101 in- 
cludes a floating point vector processor unit (VU) and 
an embedded memory (MEM). The floating point vector 
processor unit executes 64-bit micro instructions of a mi- 
croprogram stored in the micromemory 98 to be de- 
scribed more later in order to process data stored in in- 
ternal registers of the VU and the embedded memory. 

The PKE 102 expands microcode supplied thereto 
in accordance with control executed by a main DMAC 
109 to be described more later into microinstructions to 
be stored as a microprogram in the micromemory 98 
and to be executed by the VU, and expands a packet of 
packed data also supplied thereto in accordance with 
the control executed by the main DMAC 1 09, storing the 
expanded packet into the embedded memory (MEM) 
employed in the VU-MEM 101. The FMAC ( Floating 
Multiple Adder Calculation) unit 99 executes floating 
point processing whereas the divider (DIV) 100 carries 
out division. As described above, the first vector 
processing engine (VPE0) 71 is embedded in the main 
CPU 44 which carries out non-type processing in con- 
junction with the VPE0 71 . 

Much like the first vector processing engine (VPE0) 
71, the second vector processing engine (VPE1) 48 
comprises a micromemory (microMEM) 103, an FMAC 
(Floating Multiple Adder Calculation) unit 104, a divisor 
(DIV) 106, afunctional unit 107 referred to as a VU-MEM 
and a packet expander (PKE) 108. The VU-MEM 107 
includes a floating point vector processor unit (VU) and 
an embedded memory (MEM). The floating point vector 
processor unit executes 64-bit micro instructions of a mi- 
croprogram stored in the micromemory 103 to be de- 
scribed more later in order to process data stored in in- 
ternal registers of the VU and the embedded memory. 

The PKE 108 expands microcode supplied thereto 
in accordance with control executed by the main DMAC 
46 into microinstructions to be stored as a microprogram 
in the micromemory 103 and to be executed by the VU, 
and expands a packet of packed data also supplied 
thereto in accordance with the control executed by the 
main DMAC 46, storing the expanded packet into the 
embedded memory (MEM) employed in the VU-MEM 
107. The FMAC (Floating Multiple Adder Calculation) 
unit 104 executes floating point processing whereas the 
divider (DIV) 1 06 carries out division. The second vector 
processing engine 48 carries out type processing on da- 
ta supplied thereto from the main memory 45 and sup- 
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plies results of the processing to the GPUIF 72 by way 
of the GPU 49. 

The main memory 45 is used for storing data of a 
3-dimensional object and, when necessary, supplies the 
data to the first vector processing engine 71 and the sec- 
ond vector processing engine 48. A display list created 
jointly by the main CPU 44 and the first vector process- 
ing engine (VPEO) 71 is stored temporarily in a Memory 
FIFO (MFIFO) embedded in the main memory 45 before 
being supplied to the GPUIF 72 by way of the main bus 
41 . The reason why the display list is stored temporarily 
in the memory FIFO is that the main CPU 44 and the 
first vector processing engine 71 each have a process- 
ing priority lower than that of the second vector process- 
ing engine 48, making it necessary to keep the display 
list in the memory FIFO till the second vector processing 
engine 48 enters an idle state. 

In addition, the main CPU 44 and the first vector 
processing engine (VPEO) 71 jointly creates a matrix to 
be processed by the second vector processing engine 
48, storing the matrix in the main memory 45. Then, the 
second vector processing engine 48 makes a display 
list by using the matrix. 

In order to process a display list for non-type 
processing supplied from the first vector processing en- 
gine 71 by way of the GPUIF 72 and a display list for 
type processing supplied from the second vector 
processing engine 48, the GPU 49 holds a graphic con- 
text (that is, a drawing setting condition) of, among other 
things, a drawing offset and a clip range to be referred 
to at a drawing time for each of the display lists. A no- 
tation CGO used in the following description denotes a 
graphic context for non-type processing while a notation 
CG1 denotes a graphic context for type processing as 
will be described more later. 

For example, as described above, the PKE 102 ex- 
pands microcode supplied to the first vector processing 
engine 71 from the main memory 45 by way of the main 
bus 41 in accordance with control executed by the 
DMAC 109 into microinstructions to be stored as a mi- 
croprogram in the micromemory 98 and to be executed 
by the VU, and expands a packet of packed data such 
as data of a 3-dimensional object also supplied thereto 
from the main memory 45 by way of the main bus 41 in 
accordance with the control executed by the DMAC 1 09, 
storing the expanded packet into the embedded mem- 
ory (MEM) employed in the VU-MEM 101. Then, the 
FMAC 99 and the DIV 100 carry out pieces of process- 
ing such as matrix processing, transformation of coor- 
dinates and radios copy conversion on the data of the 
3-dimensional object. At that time, complex processing 
is also performed in conjunction with the CPU core 94. 
The processing typically results in a display list for draw- 
ing a swinging state of a leaf of a tree blown by a wind 
or a state of drops of rain hitting the front window of a car. 

A display list (complex stream) for drawing a 2-di- 
mensional object created in this way on a screen is 
stored temporarily in the MFIFO of the main memory 45 
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through the main bus 41 before being supplied finally to 
the GPUIF 72. 

On the other hand, as described above, the PKE 
108 expands microcode supplied to the second vector 

5 processing engine 48 from the main memory 45 by way 
of the main bus 41 in accordance with control executed 
by the main DMAC 46 into microinstructions to be stored 
as a microprogram in the micromemory 103 and to be 
executed by the VU, and expands a packet of packed 

10 data such as data of a 3-dimensional object also sup- 
plied thereto from the main memory 45 by way of the 
main bus 41 in accordance with the control executed by 
the main DMAC 46, storing the expanded packet into 
the embedded memory (MEM) employed in the VU- 

15 MEM 107. Then, the FMAC 104 and the DIV 106 carry 
out pieces of processing such as matrix processing, 
transformation of coordinates and radios copy conver- 
sion on the data of the 3-dimensional object. Based on 
a matrix and a graphic context created jointly by the 

20 main CPU 44 and the first vector processing engine 71 
and supplied to the second vector processing engine 48 
from the main memory 45 by way of the main bus 41, 
the processing is relatively simple type processing. 
A display list (simple stream) for drawing a 2-dimen- 

25 sional object created in this way on a screen is supplied 
finally to the GPUIF 72 by way of the main bus 41 . The 
two streams, that is, the complex and simple streams, 
are then transferred to the GPU 49 on a time division 
basis by arbitration. 

30 The GPU 49 executes drawing processing based 
on the display lists supplied thereto by the GPUIF 72, 
drawing polygons on the frame memory 58. If the display 
list is a display list created jointly by the main CPU 44 
and the first vector processing engine 71 on the main 

35 memory unit 45 and then supplied to the GPU 49 by way 
of the main bus 41, the GPU 49 executes the drawing 
processing by using the graphic context GC0 cited ear- 
lier. If the display list is a display list created by the sec- 
ond vector processing engine 48, on the other hand, the 

40 GPU 49 executes the drawing processing by using the 
graphic context GC1 cited before. 

A polygon drawn on the frame memory 58 is con- 
verted into an output video signal for the polygon in ac- 
cordance with control executed by the CRTC 84. 

45 Fig. 7 is a diagram showing timing with which the 
two display lists are processed. Geometry Subsystem 0 
shown in Fig. 7 corresponds to the second vector 
processing engine 48 shown in Fig. 6 whereas Geom- 
etry Subsystem 1 corresponds to the main CPU 44 and 

50 the first vector processing engine 71 . A rendering sub- 
system corresponds to the GPU 49. It should be noted 
that a hatched portion shown in the figure indicates that 
a task indicated by a task name is in an idle state. 
Fig. 7A is a diagram showing a processing proce- 
ss dure for a case in which only one processor, that is, Ge- 
ometry Subsystem 0, exists. In this case, Geometry 
Subsystem 0 makes a display list (List #0-1), supplying 
the list to the rendering system. Then, Geometry Sub- 
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system 0 continues making display lists following List 
#0-1 , that is, List #0-2 and subsequent display lists. The 
rendering system executes drawing processing in ac- 
cordance with the display list (List #0-1 ) supplied thereto 
from Geometry Subsystem 0. If Geometry Subsystem 0 
is still making the next display list (List # 0-2) at the point 
of time the rendering subsystem completes the drawing 
processing in accordance with List #0-1 , the rendering 
subsystem enters an idle state, waiting for Geometry 
Subsystem 0 to complete the creation of the next display 
list (List #0-2) and supply the list to the rendering sub- 
system. 

Thereafter, much like what has been described 
above, if Geometry Subsystem 0 has not completed the 
processing of making a next display list yet at a point of 
time the rendering subsystem completes the drawing 
processing in accordance with the current display list, 
the rendering subsystem enters an idle state, waiting for 
Geometry Subsystem 0 to supply the next list to the ren- 
dering subsystem. 

Fig. 7B is a diagram showing a processing proce- 
dure for a case in which two processors, that is Geom- 
etry Subsystem 0 and Geometry Subsystem 1 , exist. In 
this case, while Geometry Subsystem 0 is making a dis- 
play list (List #0-1), the rendering subsystem would be 
put in an idle state. For this reason, data associated with 
a display list (List #1-1) already created by Geometry 
Subsystem 1 and stored in the main memory 45 is sup- 
plied to the rendering subsystem. Receiving the first dis- 
play list (List #1-1) created by Geometry Subsystem 1, 
the rendering subsystem executes drawing processing 
based on a graphic context for Geometry Subsystem 1 
appended to the first display list (List #1-1) supplied to 
the rendering system by Geometry Subsystem 1. 

When Geometry Subsystem 0 completes the 
processing to create the first display list (List #0-1 ), Ge- 
ometry Subsystem 1 is supplying a next display list (List 
#1-2) to the rendering system. At that time, Geometry 
Subsystem 1 is forced to halt the operation to supply the 
next display list (List #1-2) to the rendering subsystem. 
Thus, Geometry Subsystem 0 can now supply the com- 
pleted the first display list (List #0-1) to the rendering 
subsystem and start to make the next display list (List 
#0-2). Receiving the first display list (List #0-1 ) from Ge- 
ometry Subsystem 0, the rendering subsystem carries 
out drawing processing based on the first display list 
(List #0-1). 

When the rendering subsystem completes the 
drawing processing based on the first display list (List 
#0-1), Geometry Subsystem 0 is still making the next 
display list (List #0-2). For this reason, Geometry Sub- 
system 1 resumes the suspended operation to supply 
the next display list (List #1 -2) to the rendering subsys- 
tem. Otherwise, the rendering subsystem would enter 
an idle state. Receiving the next display list (List #1-2) 
created by Geometry Subsystem 1, the rendering sub- 
system starts execution of drawing processing based on 
the next display list (List #1-2). 



Thereafter, much like what has been described 
above, Geometry Subsystem supplies a display list cre- 
ated thereby only when Geometry Subsystem 0 is still 
making a display list, putting the rendering subsystem 
s in an idle state. As a result, display lists made by a plu- 
rality of processors can be efficiently processed by the 
rendering subsystem. 

In order to execute the coordinate transformation 
processing faster, for example, a subprocessor or a co- 
10 ordinate transformation coprocessor can be provided 
separately from the CPU (the VU mentioned earlier) in 
each of a plurality of vector processing engines which 
share a common drawing unit, that is, the GPU 49, and 
each output display lists to the GPU 49. By employing 
such a subprocessor or a coprocessor, the CPU (proc- 
essor) in each vector processing engine is now capable 
of supplying display lists more frequently to the shared 
GPU 49. Thus, the GPU 49 has to be switched from one 
processor to another at short time intervals. Otherwise, 
an overflow will occur in a local memory provided for 
each processor. For this reason, a priority level is as- 
signed to each processor, that is, to Geometry Subsys- 
tem 0 and Geometry Subsystem 1 , as shown in Fig. 7B. 
When there is no more display list to be transferred to 
the GPU 49 from a master processor, that is, a CPU hav- 
ing the highest priority or the CPU employed in the sec- 
ond vector processing engine 48 in the case of the in- 
formation processing apparatus shown in Fig. 6 or Ge- 
ometry Subsystem 0 shown in Fig. 7, the right to make 
an access to the GPU 49 is handed over to a slave proc- 
essor, that is, a CPU having a priority second to the mas- 
ter CPU 44 or the CPU (the VU) employed in the first 
vector processing engine 71 in the case of the informa- 
tion processing apparatus shown in Fig. 6 or Geometry 
Subsystem 1 shown in Fig. 7. 

As soon as the master processor completes the 
processing of making a display list and gets prepared 
for transferring the display list to the GPU 49, the slave 
processor is forced to return the right to access the GPU 
49 to the master processor even if a display list still re- 
mains to be completed and transferred by the slave 
processor to the GPU 49. 

In general, the master processor is capable of car- 
rying out processing at a high speed but has a local 
memory with a relatively small storage capacity. On the 
other hand, a slave processor carries out processing at 
a relatively low speed but has a local memory with a 
relatively large storage capacity. 

There is also an information processing apparatus 
in which a processor 2 serving as a slave to a slave proc- 
essor 1 is further connected as shown in Fig. 8. In such 
an information processing apparatus, a processor with 
a specially low priority needs a local memory with an 
even greater storage capacity for storing more display 
lists. For this reason, a low priority is normally assigned 
to a main processor which is provided with a main mem- 
ory. In this way, the main processor also serves as a 
slave processor as well. 
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In drawing processing carried out by the GPU 49, 
in addition to vertex information described in a display 
list, environment parameters or drawing setting condi- 
tions referred to as a graphic context such as a drawing 
offset and a clip range at a drawing time are also re- 
quired as described above. The rendering subsystem 
(that is, the GPU 49) carries out drawing processing 
based on display lists supplied by each Geometry Sub- 
system (that is, the CPU) in accordance with a graphic 
context for the Geometry Subsystem. When the display 
list supplier is switched from one Geometry Subsystem 
to another, however, a lot amount of work to newly set 
a graphic context needs to be done. In order to solve 
this problem, the rendering subsystem holds as many 
graphic contexts as Geometry Subsystems. 

A graphic context is added to a display list as shown 
in Fig. 7 typically for each object supplied to the GPU 
49 to be drawn thereby. As a result, the GPU. 49 is ca- 
pable of carrying drawing processing for each object on 
the basis of a graphic context associated with the object. 

Geometry Subsystems and the rendering subsys- 
tem share the main bus 41 which comprises a data bus 
and an address bus. A Geometry Subsystem that is ac- 
cessing the rendering subsystem transmits the I D of the 
Geometry Subsystem and a display list created by the 
Geometry Subsystem to the rendering subsystem 
through the address bus and the data bus respectively. 
Receiving the ID and the display list, the rendering sub- 
system selects a graphic context corresponding to the 
ID and interprets the display list on the basis of the 
graphic context. The rendering subsystem then draws 
an image on the frame buffer. 

By letting a plurality of processors (vector process- 
ing engines or Geometry Subsystems) control a GPU 
49 (rendering subsystem) on a priority basis as de- 
scribed above, the storage capacity of a local memory 
provided in each of the processors for temporarily stor- 
ing a display list made in the processor can be reduced 
to a minimum. As a result, it is possible to carry out 
processing to make display lists in parallel in the proc- 
essors without increasing the cost of local memories. In 
addition, by - holding a graphic context for each of the 
processors in the GPU 49 (the rendering subsystem), 
the number of duplicated data transfers, that is, the 
amount of overhead work, to be carried out during con- 
text switching can be reduced. 

Strictly speaking, the processors share the data bus 
and, hence, the main memory on a time sharing basis. 
The following is description of a technique of controlling 
data in accordance with a meta-instruction embedded 
in the data itself during a transfer of the data to the GPU 
49. 

Fig. 9 is a diagram showing a typical format of a 
meta-instruction. A meta-instruction is an instruction 
added in front of data to be transferred. A meta-instruc- 
tion prescribes the length of the transferred data, a des- 
tination of the data transfer and the operation code of 
the meta-instruction. A meta-instruction comprises 128 



bits, only 64 bits of them shown in the figure are valid. 
The size of data to be transferred is specified in the first 
16-bit field QWC. The operation code of this meta-in- 
struction occupies a field from the 24th bit to the 31st 
5 bit. A field from the 32nd bit to the 63rd bit is used for 
specifying an address which this data to be transferred 
is stored at or a meta-instruction is to be read out next 
from. 

The transfer of data is controlled in accordance with 
10 the operation code of a meta-instruction embedded in 
the data as follows. 

If the operation code is "cnt" , after as many words 
of data as specified by the QWC field following this me- 
ta-instruction have been transferred, a meta-instruction 
'5 stored at an address following this packet (that is, the 
meta-instruction and the data) is executed by the proc- 
essor. If the operation code is "cnts" , after as many 
words of data as specified by the QWC field following 
this meta-instruction have been transferred by execut- 
ing stall control, a meta-instruction stored at an address 
following this packet is executed by the processor. If the 
operation code is "next" , after as many words of data 
as specified by the QWC field following this meta-in- 
struction have been transferred, a meta-instruction 
stored at an address specified in the address field is ex- 
ecuted by the processor. 

The stall control is timing control carried out by a 
processor itself to put an access made by the processor 
to the main memory 45 in a wait state till an access to 
the main memory 45 made by another processor is com- 
pleted. 

If the operation code is "REF° , after as many words 
of data as specified by the QWC field stored at an ad- 
dress ADDR specified in the address field have been 
transferred, a meta-instruction stored at an address fol- 
lowing this meta-instruction is executed by the proces- 
sor. If the operation code is "rets" , after as many words 
of data as specified by the QWC field stored at an ad- 
dress ADDR specified in the address field have been 
transferred by executing stall control, a meta-instruction 
stored at an address following this meta-instruction is 
executed by the processor. As described above, a 
"REF" meta-instruction is used for transferring data with 
a length specified in the QWC field from an address 
specified in the address field 

If the operation code is "call" , after as many words 
of data as specified by the QWC field following this me- 
ta-instruction have been transferred, an address follow- 
ing this packet is pushed (or loaded) into a register as 
a return address and a meta-instruction stored at an ad- 
dress specified in the address field of the meta-instruc- 
tion is executed by the processor. If the operation code 
is "ref , after as many words of data as specified by the 
QWC field following this meta-instruction have been 
transferred, a meta-instruction stored at an address 
popped (or read out) back from the register is executed 
by the processor. It should be noted that the address 
has been pushed to the register as a return address dur- 
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ing execution of a meta-instruction with the "CALL" op- 
eration code associated with this "RET* meta-instruc- 
tion. If the operation code is "end" , after as many words 
of data as specified by the QWC field following this me- 
ta-instruction have been transferred, the processing is 
ended. 

Fig. 10 is a diagram used for explaining operations 
which are carried out by the processor when the oper- 
ation code of meta-instructions is "next" which means 
that the next data following this meta-instruction is to be 
transferred. First of all, the main DMAC 46 reads out a 
1 word as a meta-instruction word from an address 
ADDRO stored in a Tag Address register Dn_TADR. As- 
sume that the meta-instruction is "NEXT, ADDR = 
ADDR2, LEN = 8" which means that the operation code 
is "next" , the QWC field specifies that the length of the 
data to be transferred is 8 qwords (quadlet words) where 
1 qword is 128 bits and ADDR2 is an address specified 
in the address field. Thus, in the execution of the meta- 
instruction "NEXT, ADDR = ADDR2, LEN = 8" , 8-qword 
data is transferred. Then, a meta-instruction "NEXT, AD- 
DR = ADDR1, LEN = 2" stored at the address ADDR2 
is executed. 

By the same token, in the execution of the meta- 
instruction "NEXT, ADDR = ADDR1 , LEN = 2" , 2-qword 
data is transferred under control executed by the main 
DMAC 46. Then, a meta-instruction "END, ADDR = - , 
LEN = 8" stored at an address ADDR1 is executed. In 
the execution of the meta-instruction "END, ADDR = - , 
LEN = 8" , 8-qword data is transferred. Then, the 
processing is ended. 

Fig. 11 is a diagram used for explaining operations 
which are carried out by the processor when the oper- 
ation code of meta-instructions is "REF" . First of all, the 
main DMAC 46 reads out a 1 word as a meta-instruction 
word from an address ADDRO stored in the tag address 
register Dn_TADR. Assume that the meta-instruction is 
"REF, ADDR = ADDR2, LEN = 2° . In the execution of 
the meta-instruction "REF, ADDR = ADDR2, LEN = 2" 
2-qword data stored at an address ADDR2 is trans- 
ferred. Then, a meta-instruction " REF, ADDR = ADDR1 , 
LEN = 8' following this meta-instruction is executed. 

In the execution of the meta-instruction "REF, AD- 
DR = ADDR1 , LEN = 8" , 8-qword data stored at an ad- 
dress ADDR1 is transferred. Then, a meta-instruction 
"END, ADDR = - , LEN = 8" following this meta-instruc- 
tion is executed. In the execution of the meta-instruction 
"END, ADDR = - , LEN = 8" , 8-qword data is transferred. 
Then, the processing is ended. 

Fig. 12 is a diagram used for explaining operations 
which are carried out by the processor when the oper- 
ation codes of meta-instructions are "CALL" and "RET" . 
First of all, the main DMAC 46 reads out a 1 word as a 
meta-instruction word from an address ADDRO stored 
in the tag address register Dn_TADR. Assume that the 
meta-instruction is "CALL, ADDR = ADDR1, LEN = 0" . 
(In the execution of the meta-instruction "CALL, ADDR 
= ADDR1, LEN = 0° , no data following this meta-instruc- 



tion is transferred because LEN = 0. The address of a 
meta-instruction "CALL, ADDR = ADDR2, LEN = 8" fol- 
lowing this meta-instruction is pushed into a first register 
as a return address.) After the execution of the meta- 

5 instruction "CALL, ADDR = ADDR1 , LEN = 0", a meta- 
instruction "CALL, ADDR = ADDR2, LEN = 8 B shown on 
the right side of Fig. 1 2 and stored at an address ADDR1 
is executed. In the execution of the meta-instruction 
"CALL, ADDR = ADDR2, LEN = 8" , 8-qword data fol- 

10 lowing this meta-instruction is transferred(, pushing the 
address of a meta-instruction "RET, ADDR = - , LEN = 
0 " following this packet into a second register as a return 
address.). Then, a meta-instruction "RET, ADDR = - , 
LEN = 8" stored at an address ADDR2 is executed. 

is in the execution of the meta-instruction "RET, AD- 
DR = - , LEN = 8" , 8-qword data following this meta- 
instruction is transferred. Then, the meta-instruction " 
RET, ADDR = - , LEN = 0" , the address of which was 
pushed in the second register in the execution of the 

20 meta-instruction "CALL, ADDR = ADDR2, LEN = 8" 
shown on the right side, is executed. In the execution of 
the meta-instruction "RET, ADDR = - , LEN = 0" , no data 
following this meta-instruction is transferred because 
LEN = 0. Then, the meta-instruction "CALL, ADDR = 

25 ADDR2 , LEN = 8" on the left side, the address of which 
was pushed in first the register in the execution of the 
meta-instruction " CALL, ADDR = ADDR1 , LEN = 0" , is 
executed. In the execution of the meta-instruction 
°CALL, ADDR = ADDR2, LEN = 8" on the left side, 

so 8-qword data following this meta-instruction is trans- 
ferred(, pushing the address of a meta-instruction "END, 
ADDR = - , LEN = 0" following this packet into the first 
register as a return address.). Then, the meta-instruc- 
tion "RET, ADDR = - , LEN = 8" stored at ADDR2 is ex- 

35 ecuted. 

In the execution of the meta-instruction "RET, AD- 
DR = - , LEN = 8" , 8-qword the data following this meta- 
instruction is transferred. Then, the meta-instruction 0 
END, ADDR = - , LEN = 0" , the address of which was 

40 pushed in the first register in the execution of the meta- 
instruction "CALL, ADDR = ADDR2, LEN = 8" on the left 
side, is executed. In the execution of the meta-instruc- 
tion "END, ADDR = - , LEN = 0" , no data is transferred 
because LEN = 0. Then, the processing is ended. 

^5 As described above, a transfer of data is controlled 
in accordance with a meta-instruction embedded in the 
data. 

Fig. 13 is a diagram showing a state in which a 
transfer of data is controlled in accordance with a meta- 

so instruction embedded in the data. While the main CPU 
44 is making a display list (Display List #0), data asso- 
ciated with a display list (Display List #1) preceding Dis- 
play List #0 by 1 frame is transferred to the second vec- 
tor processing engine (VPE1) 48. 

55 First of all, the main CPU 44 in a conjunction with 
the first vector processing engine 71 makes a display 
list (Display List #0) comprising a meta-instruction with 
the " NEXT" operation code, a context, a meta-instruc- 
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tion with the "REF" operation code, a meta-instruction 
with the " REF" operation code, a matrix, a meta-instruc- 
tion with the "REF" operation code, a matrix, a meta- 
instruction with the "REF" operation code, a matrix, a 
meta-instruction with the "REF" operation code, a meta- 5 
instruction with the "REF" operation code, a meta-in- 
struction with the " REF" operation code, a matrix and 
a meta-instruction with the "RET" operation code as 
shown in Fig. 13. 

As described above, while the main CPU 44 is mak- 10 
ing a display list (Display List #0) in conjunction with the 
first vector processing engine 71 , data associated with 
a display list (Display List #1) preceding Display List #0 
by 1 frame is transferred to the second vector process- 
ing engine (VPE1) 48 as follows. First of all, a meta- is 
instruction with the "NEXT" operation code at the head 
of Display List #1 is executed to transfer the subsequent 
context to the second vector processing engine 48. 
Then, a first meta-instruction with the "REF" operation 
code following the transferred context is executed to 20 
read out Program 0 stored in an object data base in the 
main memory 45 (and then transfer the program to the 
second vector processing engine 48). For example, a 
vertex data set, that is, the contents of an object shown 
in Fig. 13 can be stored somewhere and only matrices 25 
of a display list are updated. It is thus possible to gen- 
erate an image based on the observer's eyes. In this 
way, data with the contents thereof do not vary from 
frame to frame such as a program is read out from a 
display list by using a meta-instruction (to transfer the 30 
data to the second vector processing engine 48 without 
the need to include the data on the display list). Fixed 
data such as characters and constant data between 
them can be shared by meta-instructions in different dis- 
play lists. As a result, a display list can be made by up- 35 
dating only positional data (that is, matrices) included 
on the display list with ease, the contents of which vary 
from frame to frame, of an existing display list made pre- 
viously. 

It should be noted that the object data base includes 40 
3-dimensional data for describing a 3-dimensional ob- 
ject (also referred to hereafter as Vertex of Object) and 
a program for interpreting the object data. In addition, if 
texture mapping is carried out on an ornament of an ob- 
ject, image data used as a texture (referred to as a tex- 45 
ture image) is also stored in the object data base. 

Then, a second meta-instruction with the "REF" op- 
eration code following the above first "REF" meta-in- 
struction is executed to read out 3-dimensional coordi- 
nate data Vertex of Object 0, that is, vertex coordinates so 
of Object 0 (and then transfer the vertex coordinates of 
Object 0 to the second vector processing engine 48). 
Then, a first matrix is transferred to the second vector 
processing engine 48 (by execution of a meta-instruc- 
tion with the NEXT operation code following the second 55 
"REF" meta-instruction at the head of the first matrix). 
Subsequently, a third meta-instruction with the "REF" 
operation code following the transferred first matrix is 
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executed to read out 3-dimensional coordinate data Ver- 
tex of Object 1 , that is, vertex coordinates of Object 1 
(and then transfer the vertex coordinates of Object 1 to 
the second vector processing engine 48). 

Then, a second matrix following the above third " 
REF" meta-instruction is transferred to the second vec- 
tor processing engine 48 by execution of a meta-instruc- 
tion with the NEXT operation code following the third 
"RFF" meta-instruction at the head of the second matrix. 
Subsequently, a fourth meta-instruction with the "REF" 
operation code following the transferred second matrix 
is executed to again read out the 3-dimensional coordi- 
nate data Vertex of Object 1 , that is, vertex coordinates 
of Object 1 (and then transfer the vertex coordinates of 
Object 1 to the second vector processing engine 48). 
Then, a third matrix is transferred to the second vector 
processing engine 48 by execution of a meta-instruction 
with the NEXT operation code following the fourth "RFF" 
meta-instruction at the head of the third matrix. Subse- 
quently, a fifth meta-instruction with the "REF" operation 
code following the transferred third matrix is executed 
to read out Program 3 (and then transfer the program to 
the second vector processing engine 48). Then, a sixth 
meta-instruction with the "REF" operation code follow- 
ing the above fifth "REF" instruction is executed to read 
out texture image data from the frame memory 58 and 
transfer the data to the second vector processing engine 
48. 

If the texture image data is not stored in the frame 
memory 58 yet, the texture image data is transferred to 
the frame memory 58 before object data Vertex of Ob- 
ject 4, is transferred by the following seventh meta-in- 
struction with the "REF" operation code. If the texture 
image data is thawing data coming from the MDEC 47 
or transferred data coming from the sub-bus 42, the tex- 
ture image data varies from frame to frame. In this case, 
a stall function is used to establish synchronization of 
the data transfer as will be described later. 

While image data is being transferred to the second 
vector processing engine 48, processing carried out by 
the second vector processing engine 48 is suspended 
temporarily. It is thus necessary to minimize the transfer 
period of image data by halting activities performed by 
other DMA channels during this period. Halting of activ- 
ities carried out by other DMA channels can be specified 
by a predetermined control bit in a meta-instruction that 
is used for transferring image data. For example, the 
24th and 25th bits of a meta-instruction shown in Fig. 9 
are used as such control bits. 

Then, the last (seventh) meta-instruction with the 
"REF" operation code is executed to read out 3-dimen- 
isonal coordinate data Vertex of Object 4 (and then 
transfer the data to the second vector processing engine 
48). Then, the last (fourth) matrix is transferred to the 
second vector processing engine 48 (by execution of a 
meta-instruction with the NEXT operation code follow- 
ing the seventh "RFF" meta-instruction at the head of 
the fourth matrix). Finally, a meta-instruction with the 
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"RET" operation code is executed to end the transfer 
processing. 

Fig. 14 is a diagram used for explaining the stall 
control cited above. Assume that data is transferred 
from Device 0 to a main memory and then from the main s 
memory to Device 1 in an increasing order of data stor- 
age addresses in the main memory. In this case, an ad- 
dress in the main memory from which data is to be trans- 
ferred to Device 1 may at one time exceed an address 
in the main memory to which data has been transferred 10 
from Device 0 least recently for some reasons. During 
such a time, the transfer of the data from the main mem- 
ory to Device 1 is put in stalled state. 

In the example of data transfers shown in Fig. 13, 
texture image data is transferred from a storage memory is 
in the MDEC 47 to the main memory 45 and then from 
the main memory to the second vector processing en- 
gine 48 in an increasing order of data storage addresses 
in the main memory 45. In this case, an address in the 
main memory 45 from which data is to be transferred to 20 
the second vector processing engine 48 may at one time 
exceed an address in the main memory 45 to which data 
has been transferred from the storage memory in the 
MDEC 47 least recently for some reasons. During such 
a time, the transfer of the texture image data from the 25 
main memory 45 to the second vector processing en- 
gine 48 is put in a stalled state to establish transfer syn- 
chronization. 

As described above, the main DMAC 46 fetches a 
meta-instruction from a display list and executes the me- 30 
ta- instruction in order to distribute data to processors. 
As a result, by programming the order and the form or 
the priority of transferring data in the data at the time a 
display list is created by the processor in advance, the 
data can be transferred in an optimum way in depend- 35 
ence on characteristics of the data. In addition, by hav- 
ing the processor prescribe the order of transferring data 
in the form of a list such as a display list in advance, it 
is not necessary for the processor to hold wasteful cop- 
ied data for the transfer work in a memory. As a result, 40 
the number of wasteful accesses to the memory and the 
size of the display list is reduced. 

Moreover, it is necessary to store only each of data 
portions to be transferred that vary from frame to frame 
separately at 2 locations for individual display lists. Any 45 
portion of display lists that does not vary from frame to 
frame can be stored in a memory area common to all 
the display lists. Thus, the size of a memory required for 
storing display lists can be reduced. That is to say, a 
number of display lists can be stored in a memory with so 
a small storage capacity. 

On the top of that, since data is transferred in ac- 
cordance with a meta-instruction embedded in the data, 
synchronization of operations to read out and write data 
can be established among a plurality of processors with 55 
ease. As a result, a plurality of processors are allowed 
to share a memory without the need to provide a double 
buffer in the memory. 
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In the case of the embodiment described above, da- 
ta is stored in a CD-ROM. It should be noted, however, 
that other recording media can also be used as well. 

In the information processing apparatus embodying 
the first aspect of the invention, the list generating 
means is used for generating a list including data for 
3-dimensional graphics processing to be transferred to 
a processing unit and an instruction for controlling the 
transfer of the data to the processing unit. As a result, 
by programming, among other things, the order or the 
priority of transferring data in the data in advance, the 
data can be transferred in an optimum way in depend- 
ence on characteristics of the data. 

In the information processing apparatus embodying 
the second aspect of the invention, the memory is used 
for storing a list including data for 3-dimensional graph- 
ics processing to be transferred to a processing unit and 
an instruction for controlling the transfer of the data to 
the processing unit. As a result, data can be processed 
in an optimum way in dependence on characteristics of 
the data in accordance with an instruction embedded in 
the data in advance. 

In the information processing apparatus embodying 
the third aspect of the invention, the data transferring 
means is used for reading out a list including data for 
3-dimensional graphics processing to be transferred to 
a processing unit and an instruction for controlling the 
transfer of the data to the processing un it from a memory 
and used for transferring the data to the processing unit 
in accordance with the instruction. As a result, the mem- 
ory resource can be allocated to processors with a high 
degree of efficiency in accordance with the data. 

In the information processing apparatus embodying 
the fourth aspect of the invention and the information 
processing method embodying the fifth aspect of the in- 
vention, the list generating means is used for generating 
a list including data for 3-dimensional graphics process- 
ing to be transferred to a processing unit and an instruc- 
tion for controlling the transfer of the data to the process- 
ing unit and used for storing said list in a memory, and 
the data transferring means is used for reading out the 
list from the memory and used for transferring the data 
on the list to the processing unit in accordance with the 
instruction included on the list. As a result, by program- 
ming, among other things, the order or the priority of 
transferring data in the data in advance, the data can be 
transferred in an optimum way in dependence on the 
data. 



Claims 

1. An information processing apparatus wherein: 

a bus and a memory are shared on a time divi- 
sion basis by a plurality of processing units for 
carrying out 3-dimensional graphics process- 
ing in parallel; 
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said information processing apparatus provid- 
ed with a list generating means for generating 
a list including data for said 3-dimensional 
graphics processing to be transferred to said 
processing units and an instruction for control- 5 
ling transfers of said data to said processing 
units. 

2. An information processing apparatus wherein: 

10 

a bus and a memory are shared on a time divi- 
sion basis by a plurality of processing units for 
carrying out 3-dimensional graphics process- 
ing in parallel; 

and said memory is used for storing a list in- 1 $ 
eluding data for said 3-dimensional graphics 
processing to be transferred to said processing 
units and an instruction for controlling transfers 
of said data to said processing units. 

20 

3. An information processing apparatus wherein: 

a bus and a memory are shared on a time divi- 
sion basis by a plurality of processing units for 
carrying out 3-dimensional graphics process- 25 
ing in parallel; 

said information processing apparatus provid- 
ed with a data transferring means used for 
reading out a list including data for said 3-di- 
mensional graphics processing to be trans- 30 
ferred to said processing units and an instruc- 
tion for controlling transfers of said data to said 
processing units from said memory and used 
for transferring said data to said processing 
units in accordance with said instruction. 35 
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being transferred to any one of said processing 
units in accordance with an instruction on said list, 
said data transferring means puts an access to said 
data made by any other one of said processing units 
in a wait state. 

7. An information processing apparatus according to 
claim 4 wherein: 

a copy of only a portion of said list that varies 
from frame to frame is made and said copy is stored 
in either said first area or said second area of said 
memory whereas said original portion is stored in 
said second area or said first area of said memory. 

8. An information processing apparatus wherein: 

a bus and a memory are shared on a time divi- 
sion basis by a plurality of processing units for 
carrying out 3-dimensional graphics process- 
ing in parallel; 

said information processing apparatus provid- 
ed with a list generating means used for gener- 
ating a list including data for said 3-dimensional 
graphics processing to be transferred to said 
processing units and an instruction for control- 
ling transfers of said data to said processing 
units and used for storing said list in said mem- 
ory; and 

provided with a data transferring means used 
for reading out said list from said memory and 
used for transferring said data on said list to 
said processing units in accordance with said 
instruction included on said list. 

9. An information processing method whereby: 



4. An information processing apparatus according to 
claim 2 wherein: 

said memory is virtually divided into a first ar- 
ea for storing a predetermined number of said lists 40 
for predetermined frames and a second area for 
storing the rest of said lists for other frames. 

5. An information processing apparatus according to 
claim 3 wherein: 45 

when an address in an area of said memory 
from which a piece of data is to be read out by any 
one of said processing units becomes greater than 
an address in said area of said memory at which 
another one of said processing units wrote another so 
piece of data most recently, said data transferring 
means puts an operation to read out said piece of 
data carried out by said one of said processing units 
in a wait state. 

55 

6. An information processing apparatus according to 
claim 3 wherein: 

while data of a list stored in said memory is 



a bus and a memory are shared on a time divi- 
sion basis by a plurality of processing units for 
carrying out 3-dimensional graphics process- 
ing in parallel; 

said information processing method compris- 
ing the steps of: 

generating a list including data for said 
3-dimensional graphics processing to be 
transferred to said processing units and an 
instruction for controlling transfers of said 
data to said processing units and storing 
said list in said memory; 
reading out said list from said memory; and 
transferring said data on said list to one of 
said processing units in accordance with 
said instruction included on said list. 
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